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© A leader sequence to promote the secretion of gene products. 
© A recombinant DNA transfer vector contains a leader 
sequence polynucleotide which codes for a signal polypep- 
tide of formula I, 

Met - Arg - Pro - Ser- lie - His - Arg - Thr - 

Ala - He - Ala - Ala - Val - Leu - Ala -Thr- I 

Ala - Phe - Val - Ala - Gly - Thr 

Preferably the transfer vector is a plasmid. In one pre- 
ferred embodiment the leader sequence polynucleotide is 
downstream of and in reading phase with a bacterial or yeast 
^ promoter and a ribosome binding she, and upstream of and in 
reading phase with a structural gene. The structural gene may 
^ be, for example, the carboxypeptidase Gi(CPGi) gene from 
If) the chromosomal DNA of Pseudomonas species strain RS - 
16. Examples of plasmids containing the leader sequence 
polynucleotide and the CPGi gene are pNM1, pNM111, 
PNM14, pNM21, pNM22, pNM3l, pNM32 and pLEC3 
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LBAHER SEQUENCE TO PROMOTE TEE SECHETION OP GEHE PfiOTWyPS 

The present invention relates to fragments of specific 
deoxyribonucleotide sequences that promote the secretion of gene 
products from cells and in particular to recombinant DM transfer 
vectors that oontain these fragments. 
5 Recent developments in biochemistry have led to the construct- 

ion of recombinant MA transfer vectors in which, transfer vectors, 
for example plasmids, are made to contain exogenous MA. In some 
cases the recombinant incorporates heterologous MA that codes for 
polypeptides that are ordinarily not produced by the organism 
10 susceptible to transformation by the recombinant vehicle. 

In its basic outline a method of endowing a micro organism 
with the ability to synthesise a new protein involves three., 
general steps: 

(a) isolation and purification of the specific gene or nucleo- ' 
15 tide sequences containing the genetically coded information 

for the amino acid sequence of the desired protein or 
polypeptide, 

recombination of the isolated gene or nucleotide sequence 
with an appropriate transfer vector, typically MA of a 
bacteriophage or plasmid to form a recombinant transfer 
vector that codes, in part, for the production of the 
deBired protein or polypeptide, 
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(b) 
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(c) transfer of the vector to the appropriate micro organism 
and selection of a Btraln of the recipient micro organism 
containing the desired genetic information. 
Provided the gene or nucleotide sequence expresses its 
5 protein or polypeptide in the chosen micro organism, growth of the 
micro organism should then produce the desired protein or poly- 
peptide in significant quantities. 

Once the micro organism has been cultured, the protein or 
polypeptide must be isolated from the undesired materials. This 
10 step is considerably facilitated if the majority of the desired 

protein or polypeptide is present in the culture medium and/or the 
periplasmic Bpace of the micro organism. In other words purifica- 
tion may be performed in a more efficient manner if, once 
expressed, the protein or polypeptide passes through the cell 
15 membrane and out of the cytoplasm. 

The passage of the protein or polypeptide through the cell 
membrane is desirable for two main reasons. First the desired 
protein or polypeptide will generally be foreign to the micro 
organism in which it iB expressed. In many cases, therefore, it 
20 will be quickly broken down by proteolytic enzymes etc in the cells 
cytoplasm and will, subsequently, have a short half life within the 
cell. By transferring the protein or polypeptide out of the cyto- 
plasm soon after expression the stability of the protein or poly- 
peptide will be greatly increased. Second the number of unwanted 
25 genetic materials and products (from which the desired protein or 
polypeptide must be isolated) will be far greater in the cell's 
cytoplasm than in the culture medium and/or in the cell , s peri- 
. plasmic space. It can be seen that on both of the above counts 
the transfer of the protein or polypeptide through the cell membrane 
30 and out of the cytoplasm will greatly facilitate protein or poly- 
peptide isolation. 

One way in which the secretion of gene products from the 
cell 1 s cytoplasm may be promoted is to produce, within the cyto- 
plasm, a preprotein or prepolypeptide in which the desired protein 
35 or polypeptide is preceded by a signal polypeptide. The pre- 
dominantly hydrophobic signal polypeptide directB the desired 
protein or polypeptide to the cell's periplasmic space, where the 
signal peptide is removed as the desired protein or polypeptide 
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traverses the cell membrane. 

Many of the known signal peptides contain cysteine residues. 
These residues have been found to react in the cell membrane and 
thereby inhibit the efficient transfer of the desired gene product 
out of the cell. 

It is the prijnary object of the present invention to provide 
recombinant UNA transfer vectors containing a leader sequence poly- 
nucleotide that codes for a signal peptide that is cysteine free. 
Other objects and advantages of the present invention will become 
apparent from the following description thereof. 

According to the present invention there is provided a re- 
combinant MA transfer vector conprising a leader sequence poly- 
nucleotide coding for signal polypeptide of formula I 

Ket-Arg-Pro-.Ser-lle-His^Arg-Th2vAla-lle--Ala- I 
1 5 Ala-7al-Leu-Ala-Thr-Ala-Phe-Val-na-Gly-Thr 

The transfer vector may be a bacteriophage or, which is 
preferred, a plasmid. 

Preferably the majority of the codons in the nucleotide 
sequence are those preferred for the expression of microbial 
20 genomes. Suitable codons are listed in UK 1,568,047 and UK 

2007675A, and these publications are therefore incorporated herein 
by reference. 

In one preferred embodiment of the present transfer vector the 
nucleotide sequence has formula II 
25 5 f - ATG CGC CCA TCC ATC CAC CGC ACA 

GCC ATC GCC GCC GTG CTG GCC ACC II 
GCC TTC CTG GCG GGC ACC - y 
The nucleotide sequence coding for the signal polypeptide (the 
leader sequence poly nucleotide) will preferably be downstream of and 
in reading phase with a bacterial or a yeast promoter and a prokary- 
otic ribosome binding site in the transfer vector. Moreover the 
leader sequence polynucleotide will either be upstream of an insert- 
ion site for a structural gene or, which is preferred, will be up- 
stream of and in reading phase with a structural gene coding for a 
desired protein or polypeptide. Preferably the gene codes for a 
eukaryotic, particularly a mammalian, protein pr polypeptide. 

The structural gene may code, for example, for such eukaryotic 
proteins as human growth hormone, human insulin or human chorionic 
somatcmiammotropin. Alternatively it may code for such prokaryotic 
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proteins as E.coli jB-galactoaidase or Pseudomonas carboxy peptidase 
G 2 (^2^ (Carbarypeptidase ^ is an enzyme, produced by Pseudomonas 
species strain BS-16, that has application in cancer chemotherapy. 
It is a Zn 2+ containing dimer of 2 x 42,000 daltons and has high 

5 affinities (Km Tallies of 10~^ or 10" 6 )*) for both 5-methyltetra- 

hydrofolate, the predominant ciculatory form of folate in mammals and 
for the folic acid antagonist methotrexate (HEX), which is widely used 
in cancer chemotherapy. The enzyme may be used directly for the 
plasma depletion of reduced folates, essential as co-factors in purine 

10 and particularly in pyrimidine biosynthesis. CPG 2 has been shown to 
inhibit the development of the Walker 256 carcinoma in vivo and to 
remove MUX from circulation in patients where prolonged exposure to 
high doses of KCX leads to toxicity). 

Examples of transfer vectors according to the present invention 

15 that code for CPGg are pNM1, pNM111, pHM14» pNM21, pNM22, pHM31,pHM32 
and pLEC3. 

The promoter is preferably a high expression bacterial or yeast 
promoter for the structural gene in a variety of hosts. The particular 
choice of promoter will depend on the microorganism to be transformed. 

20 For example the transformation of E.coli will generally be effected by 
a transfer vector in which an E.coli promoter controls the expression 
of the structural gene. Examples of E.coli promoters are those present 
in the plasmids p3R 322 and pA2» 153. By contrast, the transformation 
of Pseudomonas species will generally be effected by a transfer 

25 vector in which a Pseudomonas promoter controls the expression of the 
structural gene. Examples of Pseudomonas promoters are those present 
in the plasmid pET 230 or Pseudomonas chromosom a l DKA.. 

In order to express the structural gene the present transfer 
vector will be transformed into a suitable microorganism. According 

30 to a further aspect of the present invention therefore -there is 

provided a microorganism transformed by a recombinant BKA transfer 
vector according to this invention. The microorganism will preferably 
be a bacterium or yeast in which high expression of the structural 
gene, within the transfer vector, occurs. Depending on the choice of 

35 promoter the microorganism may be a strain chosen from one of the 
following bacteria E.coli, Pseudomonas and Bacillus or the yeast 
Saccharomyces cerevisiae. 

Having transformed the microorganism, the protein or polypeptide, 
for which the structural gene codes, may then be expressed, by culturing 
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the transfo23ned microorganism in a culture medium. It is the primary 
advantage of the present invention that culturing the transformed 
microorganism affords a preprotein or prepolypeptide in which the 
desired protein or polypeptide is preceded by the present Bignal 
polypeptide. This means that soon after expression the signal 
polypeptide directs the deBired protein or polypeptide to the cell f s 
periplasmic space, where the signal polypeptide is removed as the 
desired protein or polypeptide traverses the cell membrance. Since 
the present signal polypeptide is free of cysteine residues the 
desired gene product will be efficiently secreted through the membrane. 

The present transfer vectors may be prepared by any of the methods 
that are well known in the recombinant MA art. For example the 
leader sequence poly nucleotide may be synthesiBed by the modified 
triester method of K.Itakura etal, J ACS , 1975, £7, 7327 or by the 
improved oligodeoxynucleotide preparation described in UK 2007675A. 
The disclosure of both of these references is incorporated herein by 
reference* The synthesised polynucleotide may then be inserted in a 
transfer vector, preferably a plasmid. In the transfer vector it will 
preferably be downstream of and in reading phase with a bacterial or a yeast 
promoter and a prokaryotic ribosome binding site. The leader sequence 
polynucleotide should also be either upstream of a structural gene 
insertion site or upstream of and in reading phase with a structural 
gene. 

Alternatively, DNA fragments containing the leader sequence 
polynucleotide may be obtained from natural sources, in particular from 
the chromosomal DNA of Pseudomonas species strain BS-16, In this 
particular caBe a polynucleotide (formula II above) coding for the 
present signal polypeptide immediately precedes a structural gene 
coding for CPG^ A number of the MA fragments containing this leader 
sequence polynucleotide may therefore be recognised by their ability, 
on insertion into a plasmid and transformation of a microorganism by 
the resultant recombinant vector, to enable a microorganism to grow on 
folate. Examples of such recombinant transfer vectors that contain both 
a polynucleotide coding for the present signal polypeptide(foxniula II 
above) and a Btructual gene coding for CPG 2 are pM1, pNM111, pNM14, 
pNM21, pNM22, pHM31, pKM}2 and pLEC3. Of coarse, aice a Fol+ recombinant 
vector has been obtained in this way it may be subcloned to afford 
alternative vectors (either Fol + or Pol"") that also contain a 
polynucleotide coding for the present signal polypeptide. 
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Once a suitable MA fragment has been isolated it may then be 
inserted in a transfer vector, preferably a plasmid. In the transfer 
vector the leader sequence polynucleotide on the inserted fragment 
should be downstream of and in reading phase with a bacterial or a yeast 

5 promoter and a prokaryotic ribosome binding site. The leader sequence 
polynucleotide should "also be either upstream of a structural gene 
insertion site or upstream of and in reading phase with a structural gene. 

The structural gene for insertion downstream of and in re ading 
phase with the present leader sequence polynucleotide may be obtained, 

10 for example, by the synthetic methods mentioned above (this is 

particularly useful for the preparation of genes co ding for wnaij 
proteins, such as human growth hormone, ;?Timri,i7i and human chorionic 
somato mamm otropin. ) Alternatively the structural gene may be prepared 
from m=ENA by the use of the enzyme reverse transcriptase or may be 

15 isolated from natural Bources (chromosomal MA). 

An example of the latter method is the isolation of MA fragments 
containing a polynucleotide sequence (shown in Table 1) co ding for the 
enzyme CPG^ (amino acid sequence also Bhown in Table 1) from 
Pseudomonas species strain RS-16 chromosomal MA. Examples of plasmids 

20 containing a CP& 2 structural gene, as well as a polynucleotide coding 
for the present signal polypeptide (formula II above), are pNM1, pHM111, 
pNM14t pNH21, pNK22, pKMJI , pNK32 and pIECJ. 

Once prepared or isolated the Leader sequence polynucleotide and 
the structural gene will be inserted into a transfer vector, preferably 

2 5 a plasmid, to form a recombinant MA transfer vector according to the 
present invention. The insertion step or steps will preferably be 
effected by one of the well known techniques in this art that employ 
restriction endonucleases, see for example the methods discussed in 
UK 2O906OOA, the disclosure of which is incorporated herein by 

3° reference. The choice of transfer vector will be determined by the 
microorganism in which the leader sequence polynucleotide and 
structural gene are to be expressed. Generally the transfer vector 
will be a cloning vehicle that is suitable for transforming the chosen 
micro-organisms and that displays a phenotypical characteristic, such 

35 as antibiotic resistance, by which the recombinant transfer vectors may 
be selected. Thus, if the micro-organism is to be E-coli, then 
suitable transfer vectors will be the E-coli plasmids pBR322 and 
pAT155« Alternatively, if the micro-organism is to be Pseudomonas, 
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A Polynucleotide Sequence, coding for CPGg, isolated froa 
Pseudononaa specie b Btraln HS - 16 chromosomal S NA 

1 

Met Arg Pro Ser lie Kie Arg Thr 

5 1 - ATO CGC CCA TCC ATC CAC CGC ACA 
10 

Ala lie Ala Ala Val Leu Ala Thr Ala 

GCC ATC GCC GGC GTG CTG GCC ACC GCC 

20 

Phe Val Ala Gly Thr Ala Leu Ala- Gin 

TTC GTG GCG GGC ACC GCC CTG GCC CAG 

30 

Lys Arg Asp Asn Val Leu Phe Gin Ala 

AAG CGC GAC AAC GTG CTG TTC CAG GCA 

40 

Ala Thr Asp Glu Gin Pro Ala Val lie 

GCT ACC GAC GAG CAG CCG GCC GTG ATC 

50 

Lys Thr Leu Glu Lys Leu Val Asn lie 

AAG ACG CTG GAG AAG CTG GTC AAC ATC 

60 

Glu Thr Gly Thr Gly Asp Ala Glu Gly 

GAG ACC GGC ACC GGT GAC GCC GAG GGC 



70 

Leu Glu 



He Ala Ala Ala Gly Asn • Phe 

ATC GCC GCT GCG GGC AAC TTC CTC GAG 

80 

Ala Glu Leu Lys Asn Leu Gly Phe Thr 

GCC GAG CTC AAG AAC CTC GGC TTC ACG 

Val Thr Arg Ser Lys Ser Ala Gly Leu 

GTC ACG CGA AGC AAG TCG GCC GGC CTG 
90 

Val Val Gly Asp Asn lie Val Gly Lys 

GTG GTG GGC GAC AAC ATC GTG GGC AAG 
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lie 
ATC 

Leu 
CTG 

Tyr 
TAC 



100 
lys 

AAG 

Leu 
CTG 

Leu 
CTC 



Gly 
GGC 

110 
Met 

ATG 

Lys 
AAG 



Arg 
CGC 

Ser 

TCG 

120 
Gly 

GGC 



Gly 
GGC 

His 
CAC 

lie 

ATT 



Gly 
GGC 

Met 
ATG 

Leu 
CTC 



Lya 
AAG 

Asp 
GAC 

Ala 
GCG 



Aan Leu 

AAC CTG 

Thr Tal 

ACC GTC 

Lys Ala 

AAG GCC 



Pro 
CCG 



Phe 
TTC 



Arg 
CGC 



Val 
GTC 



150 
Glu 

GAA 



Gly 
GGC 



Asp 

GAC 



Lys Ala 
AAG GCC 



Tyr 
TAC 



Gly 
GGC 



Pro 
CCG 



Gly 
GGC 



lie 

ATC 



140 

Ala 

GCC 



Asp 
GAC 



Asp 
GAC 



Ly3 
AAG 



Gly 
GGC 



Gly 
GGC 



Asn 
AAC 



Ala 
GCG 



Val 
GTC 



lie 
ATC 



150 
Leu 

CTG 



His 
CAC 



Thr 
ACG 



Leu 
CTC 



Lys 
AAG 



Leu 
CTG 



Leu 
CTG 



Lys 
AAG 



Glu 
GAA 



Tyr 
TAC 



160 

Gly 

GGC 



Val 

GTG 



Arg 
CGC 



Asp 
GAC 



Tyr 
TAC 



Gly 
GGC 



Thr 
ACC 



lie 
ATC 



Thr 
ACC 



Val 
GTG 



170 
Leu 

CTG 



Phe 
TTC 



Asn 
AAC 



Thr 
ACC 



Asp 
GAC 



Glu 
GAG 



Glu 
GAA 



Lys 
AAG 



Gly 
GGT 



Ser 
TCC 



180 
Phe 

TTC 



Gly 

GGC 



Ser 
TCG 



Arg 
CGC 



Asp 
GAC 



Leu 
CTG 



lie 
ATC 



Gin 
CAG 



Glu 
GAA 



Glu 
GAA 



i90 
Ala 

GCC 



Lys 
AAG 



Leu 
CTG 



Ala 
GCC 



Asp 
GAC 



Tyr 
TAC 



Val 
GTG 



Leu 
CTC 



Ser 
TCC 



Phe 
TTC 



200 
Glu 

GAG 



Pro 
CCC 



Thr 
ACC 



Ser 
AGO 



Ala 
GCA 



Gly 
GGC 



Asp 
GAC 



Glu 
GAA 



lys 
AAA 



Leu 
CTC 



210 
Ser 

TCG 



Leu 
CTG 



Gly 
GGC 



Thr 
ACC 
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Ser Gly 
TCG GGC 



lie 
ATC 



Ala 
GCC 



Tyr 
TAC 



Val 

GTG 



220 
Gin 

CAG 



Val 

GTC 



Asn 
AAC 



lie 
ATC 



Thr 
ACC 



Gly 
GGC 



Lye 
AAG 



Ala 
GCC 



Ser 
TCG 



Els 
CAT 



230 
Ala 

GCC 



Gly 
GGC 



Ala 
GCC 



Ala 
GCG 



Pro 
CCC 



Glu 
GAG 



Leu 
CTG 



Gly 
GGC 



Val 
GTG 



Asn 
AAC 



240 
Ala 

GCG 



Leu 
CTG 



Val 
GTC 



Glu 
GAG 



Ala 
GCT 



Ser 
TCC 



Asp 
GAC 



Leu 
CTC 



Val 

GTG 



Leu 
CTG 



250 
Arg 

CGC 



Thr 
ACG 



Met 
ATG 



Asn 
AAC 



lie 

ATC 



Asp 
GAC 



Asp 
GAC 



Lys 
AAG 



Ala 
GCG 



Lys 
AAG 



260 
Asn 

AAC 



Leu 
CTG 



Arg 
CGC 



Phe 
TTC 



Asn 
AAC 



Trp 
TGG 



Thr 
ACC 



lie 

ATC 



Ala 
GCC 



Lys 
AAG 



270 
Ala 

GCC 



Gly 
GGC 



Asn 
AAC 



Val 

GTC 



Ser 
TCG 



Asn 
AAC 



lie 
ATC 



lie 

ATC 



Pro 
CCC 



Ala 
GCC 



280 
Ser 

AGC 



Ala 
GCC 



Thr 
ACG 



Leu 
CTG 



Asn 
AAC 



Ala 
GCC 



Asp 
GAC 



Val 
GTG 



Arg 
CGC 



Tyr 
TAC 



290 
Ala 

GCG 



Arg 
CGC 



Asn 
AAC 



Glu 
GAG 



Asp 
GAC 



Phe 
TTC 



Asp 
GAC 



Ala 
GCC 



Ala 
GCC 



Met 
ATG 



300 
Lys 

AAG 



Thr 
ACG 



Leu 
CTG 



Glu 
GAA 



Glu 
GAG 



Arg 
CGC 



Ala 
GCG 



Gin 
CAG 



Gin 
CAG 



Lys. 
AAG 



310 
Lys 

AAG 



Leu 
CTG 



Pro 
CCC 



Glu 
GAG 



Ala 
GCC 



GAC 



Val 
GTG 



10 



Lys 
AAG 



Val 
GTG 



lie 
ATC 



320 
Val 

GTG 
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Thr Arg Gly 
ACG CGC GGC 



Arg 
CGG 



Pro 
CCG 



Ala 
GGC 



Phe 
TTC 



Asn 
AAT 



Ala 
GGC 



330 
Gly 

GGC 



Glu 
GAA 



Gly 
GGC 



Gly 
GGC 



Lys 
AAG 



Lys 
AAG 



Leu 
CTG 



Val 

GTC 



Asp 

GAC 



Lys 
AAG 



340 
Ala 

GCG 



Val 
GTG 



Ala 

GGC 



Tyr 
TAC 



Tyr 
TAC 



Lys 
AAG 



Glu 
GAA 



Ala 

GCC 



Gly 
GGC 



Gly 
GGC 



350 
Thr 

ACG 



Leu 
CTG 



Gly 
GGC 



Val 

GTG 



Glu 
GAA 



Glu 
GAG 



Arg 
CGC 



Thr 
ACC 



Gly 
GGC 



Gly 
GGC 



360 
Gly 

GGC 



Thr 
ACC 



Asp 
GAC 



Ala 
GCG 



Ala 
GCC 



Tyr 
TAC 



Ala 
GCC 



Ala 
GCG 



Leu 
CTC 



Ser 
TCA 



370 

Gly 
GGC 



Lys 
AAG 



Pro 
CCA 



Val 
GTG 



lie 
ATC 



Glu 

GAG 



Ser 
AGO 



Leu 
CTG 



Gly 
GGC 



Leu 
CTG 



380 
Pro 

CCG 



Gly 
GGC 



Phe 
TTC 



Gly 
GGC 



Tyr 

TAC 



His 
CAC 



Ser 
AGO 



Asp 

GAC 



Lys 
AAG 



Ala 
GCC 



390 
Glu 

GAG 



Tyr 

TAC 



Val 
GTG 



Asp 
GAC 



lie 

ATC 



Ser 
AGC 



Ala 
GCG 

Ala 
CGC 



lie 
ATT 

Arg 
CGC 



Pro 
CCG 

Leu 
CTG 



Arg 
CGC 

lie 
ATC 



400 
Arg 

CGC 

Met 
ATG 



Leu 
CTG 

410 
Asp 

GAT 



Tyr 
TAC 

Leu 
CTG 



Met 
ATG 

Gly 
GGC 



Ala 
GOT 

Ala 
GCC 



Gly 
GGC 



Lys 
AAG 



TGA - 3« 
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Amino acids 1 to 22 are the present signal polypeptide 
Amino acids 23 to 415 are the CPGg structural gene 

HB The leader sequence polynucleotide is the preferred polynucleo- 
tide of formula II. 
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then a suitable transfer vector will be Pseudomonas pkI230. 

The present recombinant USA. transfer vectors, micro-organisms 
transformed by the present recombinant MA transfer vectors and 
processes for the preparation of said vectors and micro-organisms will 
now be described by way of example only, with particular reference to 
the Figures in which: 

Figure 1 is a restriction enzyme cleavage Bite map of pHM1 , 
Figure 2 is a restriction enzyme cleavage site map of pHM111, 
Figure 3 is a restriction enzyme cleavage Bite map of pHM14, 
Figure 4 is a restriction enzyme cleavage site map of pHM21, 
Figure 5 is a restriction enzyme cleavage site map of pNM22, and 
Figure 6 illustrates the process for the preparation of a recombinant 

^aStoSa^rullur2 B geSr^ le ^ er ^ Cleotide — «- 

Figure 7 is a restriction enzyme cleavage site map of pEEC3. 
15 Materials and Methods 

Bacterial strains and plasmids 

The bacterial strains used were Escherichia coli W5445 (pro 
leu tM thr° sup E44 lac Y ton A r~ m~ Str r ) Pseudomonas putida 
2440 (r") and Pseudomonas sp strain BS-16. The plasmids employed were 
20 pBR322 (F Bolivar et al Gene, 1977, 2, 95), pAT153 (A J Twigg et al, 

Hature, 1980, 283, 216) and P KT230 (M Bagdasarin et al , Gene 1981, 16, 
237J and pR0G5 (R.F.Sherwood et al , The Molecular Biology of Yeast, 1979 
Cold Spring Harbor Publications) . 
Media and culture conditions 

E - col± waB routinely cultured in L-broth (1% tryptone, 0.5% yeast 
25 extract, 0.5%Ha3l). Solidified medium (L-agar) consisted of L-broth 
with the addition of 2% (w/v) agar (Bacto-Difco). Antibiotic con- 
centrations used for the selection of transf ormants were 50 ug/ml 
ampicillin, 15 ug/ml tetracycline and 30 ug/ml kanamycin. In the case 
of E.coli these were conducted in 2YT liquid medium (1.6% tryptone, 
30 1% yeast extract, 0.5% NaCl) containing 1% glucose, and 0.05% folate 
where appropriate. The pseudomonads were grown in a minimal Baits 
solution consisting of per litre: MgSO^ 0.05g; CaClg, 2^0, 0.05g; 
FeS0 4 .7H 2 0, 0.005g; MnSO^ 0.0015g; Na,^, 2^0, 0.0015g; KH^; 5g; 
K 2 HP0 4:5 H 2 0 ' 12g; Slutamate, 10g. The mfnlTTinl medium employed for 
35 E.coli was M9 medium (j Miller, Experiments in molecular genetics, 
Cold Spring Harbor laboratory, Cold Spring Harbor, Hew York, 1972). 
Purification of UNA 

Plasmids were purified from chloramphenicol amplified cultures 
(D B Clewell, J Bacterid, 1972, 110, 667) by Brij-lysis (B B Clewell 
et al, Proc Hatl Acad Sci, USA, 1969, 62, 1159) end subsequent caesium 
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chloride-etMdium bromide density gradient centrifugation (A Colman 
. et al f Eur* J Biochem, 1978 f 91 1 303) • A rapid, Fm P n Bcale plasmid 
isolation technique (Burnboim et al f Huc.Acids Res, 1979, 7, 1513) was 
also employed for screening purposes. Chromosomal UNA from the donor 
5 Pseudomonad strain (RS-16) was prepared essentially as described by 
J Mannar, J.Mol.Biol, 1961, 3, 208* 
Ee8triction, ligation and transformation methods 

Restriction endonucleases and DBA ligase were purchased from Bethesda 
Research Laboratories and used in the buffers and under the conditions 

10 recommended by the supplier. Transformation of E.coli was essentially 
as described by S N Cohen et al., Proc.Hatl.Acad.Sci., USA, 1972, 69, 
2110 f while Ps.putida was transformed by the method of M Bagdasarian and 
Z H Timmis, Current Topics in Microbiology and Immunology, Eds 
P H Hof Schneider and V Goebel, Springer Verlag, Berlin, 1981, p 47. 

15 Agarose gel electrophoresis 

Digests were electrophoresed in 0.8# agarose slab gels (10 cm x 20 cm x 
0.5 cm) on a standard vertical system (Raven), employing Tris-borate- 
EDTA buffer. Electrophoresis of undigested DNA was at 125V, 50 mA for 
3 hours, while digested UNA was electrophoresed at 15V, 10 mA for 

20 16 hours. Fragment sizes were estimated by comparison with fragments 
of TlDNA digested with Eindl ll and XUNA cut with both Hindl ll and 
EcoR I. Fragments were isolated from gels using electroelution 
(M V McDonnell et al, Proc. Nat 1. Acad. Sci, USA, 1977, 74, 4835). 
Determination of carboxypeptidase activity 

25 Bacteria were grown in 1 litre batch culture and 100 ml samples taken 
at various stages in the growth phase. Samples were cooled on ice, 
centrifuged at 13,000 x g for 10 minutes and resuspended and frozen 
in 5 ml of 0.1 M Tris HC1, pH 7.3 containing 0.2 mM ZnSO^. The cells 
were disrupted using a MSB Ultrasonic Disintegrator (150 V) at medium 

30 frequency, amplitude 2, for three 30- second intervals on ice. Cell 

debris was removed by centrifugation at 10,000 x g for 5 minutes. 

CPG 2 activity was determined after J L McCullough et al, J.Biol. Chem, 

1971, 246, 7207. A.lal reaction cuvette containing 0.9 ml of 0.1 M 

Tris-HCl,pH 7.3 plus 0.2 mM ZnSO, and 0.1 ml of 0.6 mM methotrexate was 
o 4 

35 equilibrated at 37 C. Enzyme extract was added to the test cuvette 
and the decrease in absorbance at 320 ran measured using a Pye-Unicam 
SP1800 double-beam spectrophotometer. Enzyme activity per ml extract 
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was calculated as A 320 nm absorbance/min divided by 8.3i which 
i8 equivalent to the hydrolysis of 1 jimol of MEX/min at 37°C. 
Protein concentration was determined by the method of MM Bradford, 
Anal Biochem, 1976, 72, 248. 

5 Cell fractionation techniques 

Bacterial cultures were grown in the low phosphate medium of H C 
Keu and L A Heppel, (J Biol Chem, 1964> 240, 3685), supplemented 
with 100 Jig/ml ampicillin, to an OD^^ = 1,0. 40 ml of culture was 
centrifuged at 5000 g for 10 min, washed in 5 ml of 10 mM Tris-HCl 

10 pH 7-0, and resuspended in 0.9 ml 0.58 K sucrose, 0.2 mM DTT, 30 mM 
Tris-HCl pH 8.0. Conversion to spheroplasts was achieved by the 
addition of 20 pi of lysozyme (2 mg/ml), 40 jtL 0.1 K EDTA, and 
incubation at 23° for 10 min (HC Neu et al, J Biol Chem, 1964, 239, 
3893)* The spheroplasts were placed on ice and 0.1 ml of 3096 (w/v) 

15 BSA added, followed by 5 ml of sucrose-tris buffer. Sedimentation 
of the spheroplasts was achieved by centrifugation at 5000g for 10 
min and the supernatant retained as the ^periplasmic' fraction. 
The pellet was resuspended in 5 ml 10 mM Tris-HCl, 0.2 mK DTT pH 
7.0 and and sonicated at 20 Zc/sec, 2 imps for 15 sec. Remaining 

20 whole cells were removed by centrifugation at 1000 x g for 10 min. 
Centrifugation at 100000 x g for 1 hr, at 4°C, separated the soluble 
(cytoplasmic) proteins from the particulate (membrane-bound) 
proteins. The membrane pellet was resuspended in 1 ml of 10mM 
TriB-HCl, 0.2 mM DTT, pH 7.0. 

25 CPG 2 was assayed as described* Alkaline phosphatase was assayed 
according to J Killer, Experiments in Molecular genetics, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, Kew York, 1972, NADH 
oxidase according to M J Osborn et al, j Biol Chem, 1972, 247, 3962 
and glyceraldehyde - 3 - phosphate dehydrogenase after K Suzuki et 
30 al, FSMS, 1971 f 13, 217. 
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Example 1 

Preparatio n of recombinant plasmid p NMl (A plasmid containing both 
the present leader sequence polynucleotide and the CPG 2 structural 
gene) 

fPo isolate the gene for carboxypeptidase G g together with the 
leader sequence polynucleotide chromosomal UNA prepared from the 
PseudomonaB host (strain BS-16) was partially digested with Sau3 A 
and fragments of between 6-8 Md isolated from agarose gels by 
electroelution. The ■ sized* MA was ligated with alkaline phosph- 
atase treated BamHl cut pBR322, transformed into E.coli V5445, and 
Ap r transformants selected. Of the 3,500 Ap r colonies obtained, 
approximately 70% were TC B . Utilisation of a rapid plasmid isolation 
technique on 50 Ap r TC B transformants demonstrated that 90% of the 
gene bank harboured plasmids of the expected size. As a further 
15 check on the authenticity of the gene bank, the individual clones 
were screened for the acquisition of a Leu + phenotype. Two such 
clones were identified. Both carried a plasmid capable of trans- 
forming leuB (B-isopropylmalate dehydrogenase) E.coli mutants to 
prototrophy. 

20 Acquisition of a functional CPG 2 gene should enable E.coli 

to utilise folic acid as a carbon source. The 2,400 gene b ank 
clones were screened for the ability to grow on ™^m a i me dium 
containing folate as the sole source of carbon (ie Pol + ). A 
single Pol clone was detected and shown to harbour a plasmid 

25 capable of transforming plasmid-minus V5445 to the Pol + phenotype. 
Classical restriction mapping of this plasmid (pNMl) was under- 
taken which revealed the presence of a 5.9 M insert of pseudom- 
onad UNA within pBR322. The restriction enzyme clearage site map 
of pNMl is given in Pigure 1. The nucleotide sequence of the 

30 leader sequence polynucleotide and the CPG 2 structural gene is 
given in Table 1 . 
Example 2 

Subclonlng of plasmid pML to form pNMlll 

In order to pinpoint the position of the CPG 2 gene and the 
35 leader sequence polynucleotide within the 5.9 Md insert, sub- 
cloning of various restriction enzyme fragments, into pBR322, was 
undertaken. A functional CPG^ gene was shown not to occur on 
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Xhol or Sphl fragment b of tne pHKL insert, but was present on a 
3.1 Md Bglll fragment. This latter fragment was cloned into the 
BamEL site of pBR322 to give pNMll (6.0 Mi), a further reduction 
in the size of pNMll vas achieved by digesting with Sail and 
5 religating the resultant fragment to yield pNMlll. In addition, 
plasmids in which the smaller 0.95 Md Sail fragment had become 
inserted in the opposite orientation to the parent plasmid (pNMll) 
were Fol~. Taken together these subcloning results indicate that 
the CP& 2 gene and the leader sequence polynucleotide lie between 
10 the Bglll site at 4.14 and the SA11 site at 6.03 on pNMl. 

Furthermore, the gene contains a Sphl (5«17)* Sail (5.07) and 
at least one Xhol (4-56 and/or 5-56) site. The restriction 
enzyme cleavage site map of pHMlll is given in. Figure 2. 
Example 3 

15 Preparation of recombinant plasmid pNK14« (A plasmid containing 

both the present leader sequence polynucleotide and the CPG^ 

structural gene ) 

The 3.1 Md Bgl II fragment from Example 2 above was partially 

digested with Sau3 A. These fragments were then cloned into the 

20 Bam HI site of pAT153 and transformed into E coli W5445. Of the 
r s *i~ 

two Ap Tc Fol colonies obtained, one contained a plasmid which 
had acquired an extra Sal I and Bam HI site, this was pNM 14. 
The restriction enzyme cleavage site map of pKM 14 is given in 
Figure 3» Sequencing of the leader sequence polynucleotide and 

25 the CPG 2 structural gene present in pNM 14 gave the nucleotide 

structure shown in Table 1. DM sequencing of pNM 14 also revealed 
that the Sal I - Bam EE fragment was a duplication of a segment of 
BNA from within the insert (marked * on Figure 3) composed of two 
contiguous Sau 3A fragments. 

30 Example 4 and 5 

Preparation of recombinant plasmids pMM 21 and pNM 22 

( Plasmids containing both the present leader sequence polynucleotide 

and the CPG^ structural gene) 

The 3»1 Md Bgl II fragment from Example 2 was cloned into the 

35 Bam HI site of pAT 153 and transformed into E coli W 5445, Two 

r s *f* 

Ap Tc Fol colonies were obtained, one containing a plasmid pKM 21 
in which the fragment was inserted in the opposite orientation to 
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pNM1 and one containing a plasmid pNM22 in which the fragment 
was inserted in the same orientation as pNM1. The restriction 
enzyme cleavage site maps of pKM21 and pHM22 are given in 
Figure b 4 and 5 respectively. 
5 The two plasmids, pNM21 and pNM22 both transformed E.coli 

to Pol , indicating that a pseudombnad promoter was present on 
the 3.1Md fragment. However, cells carrying the plasmid pNM21, 
in which the Bglll fragment was cloned in the opposite orientation 
to pNMl, exhibited more rapid growth with folic acid as the sole 

10 carbon source. This difference was clearly visible on agar medium, 
where colonies developed concentric yellow •halos 1 of precipi- 
tated pteroic acid, the insoluble product of folate hydrolysis. 

Confirmation that pNM21 gave enhanced expression of CPG^ 
over pNM22, was obtained by assaying enzyme production during 

15 batch growth of cells containing either plasmid. (The cells were 
grown in complex medium supplemented with 1% (w/v) glucose and 
where appropriate 0.0% (w/v) folic acid. The generation time 
was 56-66 min. The culture was sampled at hourly intervals and 
whole cells were disrupted by sonication. Enzyme activity waB 

20 determined in the centrifugal extract). Eesults are given in 
Table 2. 

The expression of CPG 2 from the plasmids pNK22 and pNM1 was 
2.5 units/litre of culture, representing 0.005# soluble protein. 
In contrast, expression from pNM21 was ?000-3500 units/litre of 
25 culture, which represented 4.7% soluble protein. As the cloned 
gene is inserted into the BamHT site of pAT15J, the observed 
higher expression of pSM21 is almost certainly due to transcript- 
ional read through from the Tc promoter. The low expression of 
CPG 2 carried on plasmids pNM and pM22 iB consistent with the 

30 view that Pseudomonas promoters function poorly in E.coli . It 
is also apparent from Table 2 that in the presence of folate 
there iB a two-fold increase in the specific activity of enzyme 
measured in cell sonicates. This phenomenon has been observed 
in all experiments, but doeB not seem to be associated with 

35 classical induction of the CPG 2 gene, as overall enzyme yield in 
the presence or absence of folate remains at about 3000 u/litre 
culture. It in fact reflects a consistent depression in the 
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soluble protein levels measured in sonicates from cells grown in 
the presence of folate. There is no obvious difference in growth 
rate of cells grown with folate and the reasons for this result 
are not clear. 

5 TABLE 2: CAEBOXYPEPTIDASE G PRODUCTION BY E COLI V5445 
, COOTATTCENG THE PLASMIDS pNM1, pNM21 and pNM22. 



CULTOEE CABBOXYPEPTXDASE G g SPECIFIC ACTIVITY (u/MG SOLUBLE 

PROTEIN) 



ACS 




pNM1 


pHM22 




pNM21 




(EE) 


-POL 


+P0L 


-POL +F0L 


-POL 


+P0L 


1 










11.5 


13.4 


2 










12.9 


9.6 


3 


.008 


.005 


.010 


.019 


13.9 


23-3 


4 


.009 


.011 


.015 


.013 


12.3 


26.9 


5 


.007 


.019 


.016 


.016 


11.5 


25.6 


6 


.005 


.024 


.014 


.023 


13-7 


24.1 


7 


.015 


.029 


.024 


.043 


13.2 


20.6 


8 


.013 


.028 


.024 


.046 


13.0 


23.6 



Expression of the cloned gene in Ps»putida 

The observation that the CPG^ gene was expressed in E.coli 
20 regardless of the orientation of the gene within the vector 

suggested that the promoter region of the CPG 2 gene had been cloned 
with the structural gene and the leader sequence polynucleotide. 
The low expression of CPG^ within E.coli from its natural promoter 
(pKM1, pNK22, pNM11l) confirmed other findings that Pseudomonas 
25 promoters are poorly recognised by B,coli BNA polymerases. It would 
be expected that if the gene was introduced back into a pBeudomonad 
cellular environment, then improved expression from the Pseud omonaB 
promoter should result. The 3.1 Md BgUI fragment was Bubcloned 
into the Pseudomonas cloning vector pKT230 at its single BamTrr site. 
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Two plasmidB were obtained, pNM31 and pNM32 representing the two 
possible orientations of the cloned gene. These plasmids were 
transformed into Ps.putida 2440 by the method of BagdaBarian and 
Timmis. Pseudomonad cells carrying both plasmids were cultured in 
5 minimal BaltB medium and enzyme production monitored. 

Yields of 500-1000 units/litre of culture were obtained regard- 
less of gene orientation within the plasmid. Specific activity of 
the enzyme in cell sonicates was 1.5 to 4.0 U/mg protein represent- 
ing 0.3 to 0.7# soluble protein (compared with < 0.05?b soluble 

10 protein in the donor strain RS-16). This result strongly indicates 
that the CPG^ promoter is present and operating in a pseudomonad 
background. When the same plasmids were transformed into E.coli 
W5445 12-40 Units/litre were found at specific activity <0.07 U/mg 
(<0.01% soluble protein). 

15 Periplasmic localisation of CPG^ 

There is evidence that CPG 2 is located in or near the peri- 
plasmic space of Pseudomonas Btrain ES-16. Pteroic acid, the 
product of CPG hydrolysis of folic acid is extremely insoluble 
and is found predominantly outside the cell in both liquid and solid 

20 media. Exogeneous pteroic acid is also seen in E.coli cultures 

containing the cloned gene when folic acid is present in the medium. 
This is clearly demonstrated by the , halo t of precipitated pteroic 
acid observed around colonies carrying plasmids in which expression 
of CPG 2 is flrom the Tc promoter of pBR322 (eg pNM2l). 

25 The localisation of CPGg produced by E.coli cells carrying 

pNM21 was examined by the separation of cellular proteins into cyto- 
plasmic, periplasmic, and whole membrane fractions. As a control, 
levelB of three marker enzymes, alkaline phosphatase (periplasmic), 
glyceraldehyde-3-phosphate dehydrogenase (cytoplasmic) and NADH.O^ 

30 oxidoreductase (membrane-bound), were also determined. As can be 
seen from Table 3 9T% of the CPG 2 activity occurs in the periplasm, 
equivalent to the marker periplasmic enzyme, alkaline phosphatase. 
This confirms the presence in pNM21 of a leader sequence polynucleo- 
tide next to the CPG 2 gene that codeB for a signal polypeptide 

35 according to this in vention that promotes the secretion of CPG 2 
from the cytoplasm into the periplasmic space. 
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Carboxypeptidase synthesised in E.ooli 

The Bpecific activity of CPG 2 in crude cell extracts of cells 
carrying pNM21 was 50-fold higher than equivalent extracts from 
Pseudomanas strain ES-16. To determine whether the cloned gene= 
product in B.coli had the same properties as CPS^ from the 
pseudomonad, enzyme was purified from B.coli carrying pNK21, The 
specific activity of purified CPG 2 (single band SDS-PAGE) was 535 
U/mg of protein, which compares to 550 U/mg of protein from the 
pseudomonad. CPG 2 purified from E.coli clone pM21 co-chromato- 
graphed with GPGg from Pseudomonas strain ES-16 at a sub-unit 
molecular weight value of 42,000 daltons. fin valines using metho- 
trexate as substrate were 7.4 x 10"^ and a.0 x 10~ 6 M respectively. 
In addition, antiserum raised against the PBeudomonas enzyme 
indicated immunological identity between the B.coli and Pseudomonas 
CPG 2 , as a confluent precipitation line was formed on Ouchterlony 
double diffusion analysis* 



TABLE 3 

Localisation of Carboxypeptidase 



FRACTION 



CPG ft 



ENZYME ACTIVITY 

AP GAPUH NADHQX 



Periplasmic 


97.0 


97.1 


6.8 


O.25 


Cytoplasmic 


2.6 


2.3 


93 


8.4 


Kembrane-bound 


0.4 


0.6 


0.2 


89.1 



AP = Alkaline phosphatase 

GAPDH = Glyceraldehyde-3-phosphate dehydrogenase 

NADH0X = MADH.Og oridoreductase 
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Example 6 

Preparation of a recombinant plasmld containing both the present 
leader sequence polynucleotide and the B-GalactoBidase structural 

Plaamid pNM14 (Example 3) vas treated with Sau 3A (QATC) and 
the fragments were cloned into the Bam HI site of M13 mp7 template 
ENA (single Btranded UNA (Step A of Figure 6). The product carrying 
a 318bp Sau 3A fragment coding for the present signal polypeptide 
and the first 22 amino acids of CPG 2 (nucleotide sequence of this 
fragment shown in Table 4) was selected and made double stranded. 
The DM coding for the signal polypeptide (and the first part of 
CPG 2 ) was then excised as an Eco EI fragment. TMb Eco BI fragment 
was then cloned into the promoter cloning vector E.coli pMC1403 
(M.J. Casadaban et al y J Bacteriol, 1980, 143, 971 ), which carries 
only the structural gene (lac Z) for B-galactosidase (ie no 
promoter and no ATG. start codon) (Steps B and C of Figure 6) . 
PlasmidB were obtained in which the Eco EI fragment had inserted in 
both orientations, however, only those in which fusion of the CPG^ 
sequence to the B-galactosidase sequence had occurred (i) yielded 
a 0.34 Kb fragment upon digestion with BamEEI; (ii) enabled the host 
cell to hydrolyse the colourless lactose analogue, BCIG, and impart 
a blue colouration to colonies. The O.34 Kb BamT TT fragment has been 
recloned into M13mp7 and sequenced to confirm that fusion has 
occurred. The •precursor 1 fusion produced will consist of the 
signal peptide, the first 22 amino acids of CPG^, 6 amino acids 
derived from the M13mp7 and pMC1403linker units, and B-galactosidase 
from its 8th amino acid onward. 

Localisation experiments have been performed on cells carrying 
a plaamid coding for the 'fusion gene 1 where the cellular proteins 
have been fractionated into periplasmic, cytoplasmic and membrane 
fractions. In these experiments an organism (E f coli MG 1061 ) which 
is deleted for the lac Z gene was grown in phosphate medium (H.C. 
Neu et al, J Biol Chem, 1964, 240, 3685) and periplasmic enzymes were 
released from the harvested cells by conversion to spheroplasts. 
Separation of Boluble proteinB (cytoplasmic) from particulate 
proteinB (membrane band) was achieved by BoniPating the harvested 



0121352 

22 

spheroplasts and subsequent centrifugation at 100 t 000g for 1hr, 
to Bediment the cell membrane (T.J.Silhary et al, Proc Katl Acad 
Sci USA, 1976, 73, 3423). 

The results given in Table 5 demonstrate the presence of 50% 
of the B-galactosidase activity in the periplastic space. This 
result is in direct contrast to similar work involving fusion of 
other periplasmic protein signal sequences to B-galactosidase, 
vhere the fusion proteins are not exported, but become jammed in 
the membrane (P.J. BasBfordet al, J Bacterid, 1979, 139, 19 and 
S D Snr et al, J Cell, Biol, 1980, 86 , 70t). ■•- 

TABLE 4 

The Polynucleotide Sequence of the 318 bp San 3A Fragment from 
Recombinant Plasmid pMH14 

5 f - G ATC CAC GCA CTG AAG GCG CGC GGC 

AAG ACG CGC GGC GTG GCG ACG CTG TGC 

ATC GGC GGG GGC GU GGC ACC GCA CTG 

GCA CTC GAT TGC TAT AAG AAC CAT. GGC 

TGG GGA CGC CCG ACA ACA GGC GTC CAC 

CAG CTT TTT TCA TTC CGA CAA CCC GAA 

CGA ACA ATG CGT AGA GCA GGA GAT TCC 
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Table 4 (comtd) 
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10 



Ala 
GCC 

Phe 
TTC 

Lys 
AAG 

Ala 
GOT 



Met 
ATG 

lie 
ATC 

Val 
GTG 

Arg 
CGC 

Thr 
ACC 



Arg 
CGC 

Ala 

GCC 

Ala 
GCG 

Asp 

GAC 

Asp 

GAC 



Pro 
CCA 

Ala 

GCC 

Gly 
GGC 

Asa 
AAC 

Gla 
GAG 



Ser 
TCC 

Val 

GTG 

Thr 
ACC 

Val 

GTG 

Gin 
CAG 



lie 
ATC 

Leu 
CTG 

Ala 
GCC 

Leu 
CTG 

Pro 
CCG 



His 
CAC 

Ala 
GCC 

Leu 
CTG 

Phe 
TTC 

Ala 

GCC 



Arg 
CGC 

Thr 
ACC 

Ala 

GCC 

Gin 
CAG 

Tal 

GTG 



Thr 
ACA 

Ala 

GCC 

Gin 
CAG 

Ala 
GCA 

lie 
ATC 
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HB. This fragment carries the leader sequence coding for the 
signal polypeptide, a part of the CPG 2 structural gene coding 
for the first 22 amino acids of the protein, the ATG start 
codon, the CPG 2 ribosome binding site (AGGA.) and other components 
of the CPG 2 promoter region. 



TABLE 5 

Localisation of Signal Peptide - B-galactosidase Fusion 
Protein 







% LOCALISATION* 




CPG 










/B-GAL 


AP 


GAEDH 


NADHOX 


Periplasmic 


50.3 


97.3 


5.4 


0.4 


Cytoplasmic 


30.9 


2.5 


95-3 


8.2 


Membrane-bound 


18.8 


0.2 


1.3 


89.4 



CPG^/B-GAL 
AP 

GAPDH 



average results from 4 experiments 

Carboxypeptidase G^B-galactosidase fusion protein 
Alkaline phosphatase 

Glyceraldehyde->-phosphate dehydrogenase 
UADH.0 2 Qxidoreductase 
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ETarcple 7 

Preparation of a recombinant plasmid, containing froth the present 
leader sequence polynucleotide and the GPG^ structural gene, able 
to replicate in B.coli and S. cerevisiae 
5 A 2.03 kilobase BftfnF T fragment coding for the present signal 

polypeptide and the entire CPG^ molecule was cloned in both orient- 
ations into the BamT TT site of an E. coli/S, cerevisiae shuttle vector 
pR0G5 (B.F. Sherwood and B.K. Gibson, The Molecular Biology of Yeast, 
1979t Cold Spring Harbor Publications) to give plasmids pLECJ and 

10 pLEC4 (Figure 7). These plasmids were transformed into S. cerevisiae 
Btrain LL20 by the lithium acetate induced transformation method 
described by Ito et al. , J. Bact., 1985, 155 t 163. Yields equivalent 
to 10-20 units/litre of culture volume were obtained regardless of 
gene orientation within the plasmid. Specific activity of the enzyme 

15 in total cell extracts was 0*2-0^3V*Qg protein representing 0.005?£ 
soluble protein. This level of expression from the pseudomonad 
promotor in a yeast background is similar to the level found when the 
gene was reading from its own promotor in E.coli ( 0.01Jo soluble protein). 
Localisation experiments have been performed on yeast cells 

20 carrying the above plasmids by sphaeroplasting the cells using standard 
techniques described by J.B«J). Beggs, Eature, 1978, 275 , 105. Peri- 
plastic enzymes, localised outside of the cell membrane, were released 
when the cell wall was removed. The osmotic stabiliser (1.2K 
sorbitol) was then replaced by 0.1Itf Tris-HCl buffer, p5 7.3 containing 

25 0.2mK ZnCl^ to lyse the sphaeroplasts and the whole centrifuged at 

100,000 x g for 1 hour to separate proteins in the soluble cytoplasmic 
fraction from membrane bound proteins. The results in Table 6 
demonstrate the presence of 64?c of the CPG^ activity in the peri- 
plastic fraction and a further 16?o associated with the cell membrane 

30 fraction. 

. TABLE 6 

Localisation of GPG 2 in S. cerevisiae 

% CPG 2 activity 
Periplasinic 64 
Cytoplasmic 20 
Membrane bound 16 
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JX/6425/teuxope 

CLAIMS 

1. A recombinant DKA transfer vector comprising a leader 
sequence polynucleotide characterised in that the leader 
sequence polynucleotide codes for a signal polypeptide of 
formula I, 

5 Ket - Arg - Pro - Ser - lie - Eis - Arg - Thr - 

Ala - lie - Ala - Ala - Val - leu - Ala - Thr - I 
Ala - Phe - Val - Ala - Gly - Thr 

2. A recombinant IMA transfer vector according to claim 1 
characterised in that the leader sequence polynucleotide is 

10 of formula II, 
I 

5 - ATG CGC CCA TCC ATC CAC CGC ACA 

GCC ATC GCC GCC GTG CTG GCC ACC n 
GCC TTC GTG GCG GGC ACC - 3 ^ 

3. A recombinant Kv& transfer vector according to either 
15 claim 1 or claim 2 characterised in that the leader sequence 

polynucleotide is downstream of and in reading phase with a 
bacterial or a yeast promoter and a prokaryotic ribosome 
binding site. 

4. A recombinant DIIA transfer vector according to any one 
of claims 1 to 3 characterised in that the leader sequence 
polynucleotide is upstream of and in reading phase with a 
structural gene. 

5- A recombinant DKA transfer vector according to claim 4 
characterised in that the structural gene codes for human 
25 growth hormone, human insulin or human chorionic somato- 
mammotropin. 

6. A recombinant DKA transfer vector according to claim 4 
characterised in that the structural gene codes for S.coli 

p - galactosidase. 
JU 7. A recombinant DKA transfer vector according to claim 4 
characterised in that the structural gene codes for 

Pseudomonas carboxypeptidase G ? (C?G 2 ). 

8. A recombinant DliA transfer vector according to claim 7 
comprising a polynucleotide of formula 



20 
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5 . - 

Ala 
GCC 



1 

Met 
ATG 

10 
lie 

ATC 



Arg 
CGC 

Ala 
GCC 



Pro 
CCA 

Ala 
GCC 



Ser 
TCC 

Val 
GTG 



lie 
ATC 

Leu 
CTG 



His 
CAC 

Ala 

GCC 



Arg 
CGC 

Thr 
ACC 



Thr 
ACA 

Ala 
GCC 



Phe 
TTC 

Ljb 
AAG 



Val 
GTG 

Arg 
CGC 



20 
Ala 

GCC 



Asp 

GAC 



Gly 
GGC 

30 
Asu 

AAC 



Thr 
ACC 

Val 

GTG 



Ala 

GCC 

Leu 
CTG 



Leu 
CTG 

Phe 
TTC 



Ala 

GCC 

Gin 
CAG 



Gin 
CAG 

Ala 
GCA 



Ala 
GCT 

Lys 
AAG 



Thr 
ACC 

Thr 
ACG 



Asp 
GAC 

Leu 
CTG 



Glu 
GAG 

Glu 
GAG 



40 
Gin 

CAG 



Lys 
AAG 



Pro 
CCG 

50 
Leu 

CTG 



Ala 
GCC 

Val 
GTC 



Val 
GTG 

Asn 
AAC 



lie 
ATC 

lie 
ATC 



Glu 
GAG 



Thr 
ACC 



Gly 
GGC 



Thr 
ACC 



Gly 
GGT 



Asp 
GAC 



60 
Ala 

GCC 



Glu 
GAG 



Gly 
GGC 



lie 
ATC 



Ala 
GCC 



Ala 
GCT 



Ala 

GCG 



Gly 
GGC 



AAC 



Phe 
TTC 



70 
Leu 

CTC 



Glu 
GAG 



Ala 
GCC 



Glu 
GAG 



Leu 
CTC 



Lys 
AAG 



Asn 
AAC 



Leu 
CTC 



Gly 
GGC 



Phe 
TTC 



80 
Thr 

ACG 



Val 
GTC 

90 
Val 

GTG 



Thr 
ACG 

Val 
GTG 



Arg 
CGA 

Gly 
GGC 



Ser 
AGO 

Asp 

GAC 



Lys 
AAG 

Asn 
AAC 



Ser 
TCG 

lie 
ATC 



Ala 

GCC 

Val 
GTG 



Gly 
GGC 

Gly 
GGC 



Leu 
CTG 

Lys 
AAG 



lie 
ATC 

Leu 
CTG 

Tyr 
TAC 



100 
Lys 

AAG 

Leu 
CTG 

Leu 
CTC 



Gly 
GGC 

110 
Met 

ATG 

Lys 
AAG 



27 

Arg 
CGC 

Ser 

TCG 

120 
Gly 

GGC 



Gly 
GGC 

His 
CAC 

lie 
ATT 



Gly 
GGC 

Met 
ATG 

Leu 
CTC 



LyB 
AAG 

Asp 
GAC 

Ala 
GCG 
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Asn Leu 
AAC CTG 



Thr 
ACC 

Lys 
AAG 



Val 
GTC 

Ala 
GCC 



Pro 
CCG 



Phe 
TTC 



Arg 
CGC 



Val 
GTC 



130 
Glu 

GAA 



Gly 
GGC 



ABp 
GAC 



Lys 
AAG 



Ala' 
GCC 



Tyr 
TAC 



Gly 
GGC 



Pro 
CCG 



Gly 
GGC 



lie 
ATC 



140 
Ala 

GCC 



Asp 
GAC 



Asp 
GAC 



Lys 
AAG 



Gly 
GGC 



Gly 
GGC 



Asn 
AAC 



Ala 
GCG 



Val 
GTC 



lie 
ATC 



150 
Leu 

CTG 



His 
CAC 



Thr 
ACG 



Leu 
CTC 



Lys 
AAG 



Leu 
CTG 



Leu 
CTG 



LyB 
AAG 



Glu 
GAA 



Tyr 
TAC 



160 
Gly 

GGC 



Val 
GTG 



Arg 
CGC 



Asp 
GAC 



Tyr 
TAC 



Gly 
GGC 



Thr 
ACC 



lie 
ATC 



Thr 
ACC 



Val 
GTG 



170 
Leu 

CTG 



Phe 
TTC 

180 
Phe 

TTC 



Asn 
AAC 

Gly 
GGC 



Thr 
ACC 

Ser 
TCG 



Asp 

GAC 

Arg 
CGC 



Glu 
GAG 

Asp 

GAC 



Glu 
GAA 

Leu 
CTG 



Lys 
AAG 

lie 
ATC 



Gly 
GOT 

Gin 
CAG 



Ser 
TCC 

Glu 
GAA 



Glu 
GAA 



190 
Ala 

GCC 



Lys 
AAG 



Leu 
CTG 



Ala 

GCC 



Asp 

GAC 



Tyr 
TAC 



Val 

GTG 



LeU 
CTC 



Ser 
TCC 



Phe 
TTC 



200 
Glu 

GAG 



Pro 
CCC 



Thr 
ACC 



Ser 
ACC 



Ala 
GCA 



Gly 
GGC 



Asp 
GAC 



Glu 
GAA 



Lys 



Leu 

CTC 



28 

210 
Ser 

TCG 



Leu 
CTG 



Gly 
GGC 



Thr 
ACC 
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Ser Gly 
TCG GGC 



lie 
ATC 



Ala 
GCC 



Tyr 
TAC 



Yal 
GTG 



220 
Gin 

CAG 



7al 
GTC 



Asn 
AAC 



lie 
ATC 



Thr 
ACC 



Gly 
GGC 



lys 
AAG 



Ala 
GCC 



Ser 
TCG 



His 
CAT 



230 
Ala 

GCC 



Gly 
GGC 



Ala 
GCC 



Ala 
GCG 



Pro 
CCC 



Glu 
GAG 



Leu 
CTG 



Gly 
GGC 



Yal 
GTG 



Asn 

AAC 



240 
Ala 

GCG 



Leu 
CTG 



Yal 
GTC 



Glu 
GAG 



Ala 

GOT 



Ser 
TCC 



Asp 
GAC 



Leu 
CTC 



Yal 

GTG 



Leu 
CTG 



250 
Arg 

CGC 



ThT 
ACG 



Met 
ATG 



Asn 
AAC 



lie 
ATC 



Asp 
GAC 



Asp 

GAC 



Lye 
AAG 



Ala 
GCG 



Lys 
AAG 



260 
Asn 

AAC 



Leu 
CTG 



Arg 
CGC 



Phe 
TTC 



Asn 
AAC 



Trp 
TGG 



Thr 
ACC 



lie 
ATC 



Ala 

GCC 



Lys 
AAG 



270 
Ala 

GCC 



Gly 
GGC 



Asn 
AAC 



Yal 
GTC 



Ser 
TCG 



Asn 
AAC 



lie 
ATC 



lie 
ATC 



Pro 
CCC 



Ala 
GCC 



280 
Ser 

AGO 



Ala 

GCC 



fbrr 

ACG 



Leu 
CTG 



Asn 
AAC 



Ala 
GCC 



Asp 
GAC 



Yal 
GTG 



Arg 
CGC 



Tyr 
TAC 



290 
Ala 

GCG 



Arg 
CGC 



Asn 
AAC 



Glu 
GAG 



Asp 

GAC 



Phe 
TTC 



Asp 

GAC 



Ala 

GCC 



Ala 
GCC 



Met 
ATG 



300 
Lys 

AAG 



Thr 
ACG 



Leu 
CTG 



Glu 
GAA 



Glu 
GAG 



Arg 
CGC 



Ala 

GCG 



Gin 
CAG 



Gin 
CAG 



Lys 
AAG 



310 
Lys 

AAG 



Leu 
CTG 



Pro 
CCC 



Glu 
GAG 



Ala 
GCC 



29 
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Asp 

GAC 



Val 

GTG 



Lys 
AAG 



Val 
GTG 



lie 

ATC 



320 
Val 

GTC 



Thr 
ACG 



Arg 
CGC 



Gly 
GGC 



Arg 
CGC 



Pro 
CCG 



Ala 
GCC 



Phe 
TTC 



Asn 
AAT 



Ala 
GCC 



330 
Gly 

GGC 



Glu 
GAA 



Gly 
GGC 



Gly 
GGC 



Lye 
AAG 



lys 
AAG 



Leu 
CTG 



Val 
GTC 



Asp 
GAC 



LyB 
AAG 



340 
Ala 

GCG 



Val 
GTG 



Ala 

GCC 



Tyr 
TAC 



Tyr 
TAC 



Lye 
AAG 



Glu 
GAA 



Ala 
GCC 



Gly 
GGC 



Gly 
GGC 



350 
Thr 

ACG 



Leu 
CTG 

360 
Gly 

GGC 



Ser 
TCA 

Gly 
GGC 

Asp 
GAC 

Ala 
GCG 

Ala 

CGC 

Gly 
GGC 



Gly 
GGC 

Thr 
ACC 

370 
Gly 

GGC 



Leu 
CTG 

Lys 
AAG 

lie 
ATT 

Arg 
CGC 

Lys 
AAG 



Val 

GTG 

Asp 

GAC 

Lys 
AAG 



380 
Pro 

CCG 



Ala 
GCC 

Pro 
CCG 

Leu 
CTG 



Glu 
GAA 

Ala 
GCG 

Pro 
CCA 

Gly 
GGC 

390 
Glu 

GAG 



Arg 
CGC 

lie 
ATC 



Glu 

GAG 

Ala 
GCC 

Val 

GTG 

Phe 
TTC 

Tyr 
TAC 

400 
Arg 

CGC 

Met 
ATG 



Arg 
CGC 

Tyr 
TAC 

lie 

ATC 

Gly 
GGC 

Val 
GTG 



Leu 
CTG 

410 
Asp 

GAT 



Thr 
ACC 

Ala 
GCC 

Glu 
GAG 

Tyr 
TAC 

Asp 
GAC 

Tyr 
TAC 

Leu 
CTG 



Gly 
GGC 

Ala 
GCG 

Ser 
AGC 

His 
CAC 

lie 
ATC 

Met 
ATG 

Gly 
GGC 



Gly 
GGC 

Leu 
CTC 

Leu 
CTG 

Ser 
AGC 

Ser 
AGC 

Ala 

GOT 

Ala 
GCC 



TGA - y 
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50 

9* A recombinant DKA transfer vector according to any- 
preceding claim characterised in that the transfer vector is a 
plasmid. 

10, A recombinant DKA transfer vector according to claim 9 
5 whenever taken together with claim 7 having the designation 
pEI'11, pKK111, pEK14, pKK21, pEK22, pHB31 f pKE32 or pI2C3. 
11* A microorganism transformed "by a transfer vector 
characterised in that the transfer vector is a recombinant 2JKA 
transfer vector according to claim 1 . 
10 12. A microorganism according to claim 11 characterised in 

that the transfer vector is a recombinant UEA transfer vector 
according to claim 4. 

13. A microorganism according to either claim 11 or claim 12 
which is a bacterium of the species B.coli, Pseudomonas or 

15 Bacillus or a yeast of the species Saccharomyces cerevisiae. 

14. A process for the preparation of a gene product characterised 
by 

(a) culturing a microorganism according to claim 12 in a 
culture medium to produce the gene product in the culture 

20 medium or the periplastic space of the microorganism, and 

(b) isolating the gene product from the culture medium 
or the periplasmic space of the microorganism. 

15. A process according to claim 14 characterised in that the 
gene product is Pseudomonas carboxypeptidase or 3.coli 

£ - galactosidase. 
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